A New Clustering Model Based on Word2vec Mining on Sina Weibo Users’ Tags

نویسندگان

  • Bai Xue
  • Chen Fu
  • Zhan Shaobin
چکیده

Clustering of Weibo users is one of the most important topics in data mining on social network. Clustering can help dig out the relations among people or between people and resources. A lot of work relating to clustering has been done on analyzing personal relationship, whereas we focus our clustering model on preferences and interests. In this article, we propose a new clustering model focusing on users’ tags people choose to describe themselves. First, we will study the characteristics of Sina Weibo tags of users, which are the foundation of the new clustering model. Second, we will use the word2vec tool to cluster Weibo users based on their tags and verify the accuracy of the results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Topical differences between Chinese language Twitter and Sina Weibo

Sina Weibo, China’s most popular microblogging platform, is currently used by over 500M users and is considered to be a proxy of Chinese social life. In this study, we contrast the discussions occurring on Sina Weibo and on Chinese language Twitter in order to observe two different strands of Chinese culture: people within China who use Sina Weibo with its government imposed restrictions and th...

متن کامل

Automatic Hashtag Recommendation in Social Networking and Microblogging Platforms Using a Knowledge-Intensive Content-based Approach

In social networking/microblogging environments, #tag is often used for categorizing messages and marking their key points. Also, since some social networks such as twitter apply restrictions on the number of characters in messages, #tags can serve as a useful tool for helping users express their messages. In this paper, a new knowledge-intensive content-based #tag recommendation system is intr...

متن کامل

Mining the Personal Interests of Microbloggers via Exploiting Wikipedia Knowledge

This paper focuses on an emerging research topic about mining microbloggers’ personalized interest tags from their own microblogs ever posted. It based on an intuition that microblogs indicate the daily interests and concerns of microblogs. Previous studies regarded the microblogs posted by one microblogger as a whole document and adopted traditional keyword extraction approaches to select high...

متن کامل

Impact of Multimedia in Sina Weibo: Popularity and Life Span

Multimedia contents such as images and videos are widely used in social network sites nowadays. Sina Weibo, a Chinese microblogging service, is one of the first microblog platforms to incorporate multimedia content sharing features. This work provides statistical analysis on how multimedia contents are produced, consumed, and propagated in Sina Weibo. Based on 230 million tweets and 1.8 million...

متن کامل

Discovering High-quality Users from Sina Weibo Based on Trust Transfer Model ?

This paper devotes to discovering the high-quality users from Sina microblog (Weibo) which is the most popular microblog site in China. First, the Trust Transfer Model (TTM) is introduced as a theoretical background to make sure that users are trustworthy and high-quality. Then, a Breadth First Search (BFS) crawler based on TTM is implemented to capture users’ profile data via Weibo APIs. There...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014